Comparative evaluation of CA for subband cocktail-party

نویسنده

  • Frédéric Berthommier
چکیده

For speech segregation, a recurrent blind separation model (BSS) is tested together with a Computational Auditory Scene Analysis (CASA) model, which is based on the localisation cue and the evaluation of the Time Delay Of Arrival (TDOA). The test database is composed of 332 binary mixture sentences recorded in stereo with a static set-up. These are truncated at 1 second for the simulations. For applying the two models, we divide the frequency domain into a variable number of subbands, which are processed independently. Then, we evaluate the gain, using reference signals recorded in isolation. After a careful analysis, we find similar gains of about 2-3dB for both methods. The variation of the number of subbands allows an optimisation, and we obtain a significant peak at 4 subbands for the CASA model, as well as a maximum at 2 subbands for the BSS model.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparative evaluation of CASA and BSS models for subband cocktail-party speech separation

For speech segregation, a blind separation model (BSS) is tested together with a CASA model which is based on the localisation cue and the evaluation of the time delay of arrival (TDOA). The test database is composed of 332 binary mixture sentences recorded in stereo with a static set-up. These are truncated at 1 second for the simulations. For applying the two models, we cut the frequency doma...

متن کامل

A Casa Front-end Using the Localisation Cue for Segregation and Then Cocktail-party Speech Recognition

We propose and test a cocktail-party recognition technique based on segregation applied before recognition. This CASA front-end uses the TDOA (Time Delay Of Arrival) evaluated within subbands in order to determine the Relative Level (RL) of two competing speech sources. To perform the evaluation of the model, we have recorded a stereo database ST-NB95 from the mono Numbers95. This is composed o...

متن کامل

A CASA-labelling model using the localisation cue for robust cocktail-party speech recognition

We propose a new cocktail-party recognition technique based on the coupling of a CASA-labelling method using the TDOA (Time Delay Of Arrival) with multistream recognition. This is an alternative to the classical "segregate and recognise" architecture. First, we have recorded a stereo database ST-NB95 from the mono Numbers95. This is composed of binary mixtures of sentences at 0dB, placed left a...

متن کامل

Evaluation of CASA and BSS models for subband cocktail-party speech separation

For speech segregation, a recurrent blind separation model (BSS) is tested together with a CASA model, which is based on the localisation cue and the evaluation of the time delay of arrival (TDOA). The test database is composed of 332 binary mixture sentences recorded in stereo with a static set-up. These are truncated at 1 second for the simulations. For applying the two models, we cut the fre...

متن کامل

Measured Performance for Real-Time Localization of Cocktail-Party Talkers

Technology improvements, hardware, software and algorithmic, have made the use of a largeaperture microphone array cost effective. In this paper we present real, measured results for our wired, 128microphone array that surrounds a focal area (room) of about 7Mx5M. While it was necessary to evaluate the performance of the array offline using the array’s recording feature, we ensured that all the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002